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ABSTRACT 


A  land  use/land  cover  identification  methodology  using  LANDSAT  imagery  has  been  applied  to 
six  watersheds  across  the  U.S.  The  land  use  information  is  stored  in  a  grid  cell  data  bank 
and  is  the  basis  for  calibration  of  hydrologic  parameters  for  watershed  models.  Flood 
frequency  studies  have  been  completed  on  four  of  the  watersheds  with  land  use  derived  from 
both  satellite  data  and  conventional  low  altitude  aerial  photography.  This  paper  discusses 
our  experience  using  the  LA1DSAT  land  use  classification  procedure  and  compares  hydrologic 
results  obtained  from  the  alternative  determinations  of  land  use. 

Land  use/land  cover  has  been  found  to  have  a  significant  effect  on  the  quantity,  quality, 
and  timing  of  storm  runoff  from  urban  (and  urbanizing)  drainage  basins.  In  an  attempt  to 
quantify  this  important  relationship,  the  Hydrologic  Engineering  Center  (HEC)  has  interfaced 
state-of-the-art  spatial  data  management  techniques  with  hydrologic  planning  models,  such  as 
HEC-1  and  STORM,  to  simulate  storm  runoff  as  a  function  of  land  use.  The  grid  cell  data  banks 
also  contain  information  on  the  watershed's  environmental,  economic,  and  social 
characteristics,  thus  permitting  a  comprehensive  understanding  of  the  interaction  between  the 
water  resource  system  and  possible  future  urban  developments. 

HEC  has  been  involved  in  a  NASA  ASVT  project  which  tests  and  evaluates  a  procedure 
developed  at  the  University  of  California,  Davis  (UCD)  for  determining  land  use/land  cover 
from  LA1CSAT  imagery.  The  UCD  procedure  was  designed  with  the  objective  of  providing  Corps  of 
Engineers  District  offices  with  an  operational  cost-effective  alternative  to  conventional 
methods  of  obtaining  land  use  data.  A  constraint  on  the  procedure  was  that  it  not  require  the 
use  of  special  image  processing  or  computing  equipment  beyond  that  which  would  normally  be 
available  to  the  field  office;  i.e.,  line  printer,  card  reader,  remote  terminal,  and  access  to 
a  general  purpose  computer.  The  UCD  procedure  consists  of  an  integrated  set  of  computer 
programs  centered  around  an  unsupervised  classification  routine.  Data  quality  check, 
geometric  registration  and  correction,  data  classification,  symbol  map  generation,  resampling 
and  masking  are  all  accomplished  without  the  use  of  an  interactive  color  image  display. 

HEC  has  applied  the  UCD  classification  method  to  Crow  Creek  near  Davemport,  Iowa,  and 
Walnut  Creek  near  Austin,  Texas.  Available  ground  truth  data  permitted  the  identification  of 
seven  land  cover  categories  from  the  LANDSAT  imagery:  agricultural,  residential /highways, 
indust  ria  1/cotnnere  ia  1 ,  grassland,  forest,  undeveloped  open  space,  and  water.  Hydrologic 
simulations  of  four  additional  watersheds,  previously  classified  during  the  development  of  the 
procedure,  were  made,  using  both  conventional  and  LANDSAT  land  use  data.  Resulting  discharge 
frequency  curves  were  compared  to  determine  the  effectiveness  of  IANDSAT  land  use  in 
estimating  "true"  land  use  for  hydrologic  modeling  purposes. 

In  addition,  two  commercial  contractors  provided  a  land  use  classification  of  the  Walnut 
Creek  watershed  based  on  the  same  LAICSAT  scene  to  help  verify  the  accuracy  of  the  UCD 
procedure  with  current  state-of-the-art  classification  methodologies.  Because  the  LANDSAT 
derived  land  use  was  placed  in  data  banks  which  contained  conventionally  classified  land  use, 
detailed  cell  by  cell  comparisons  were  made  between  the  conventional  and  all  of  the  LANDSAT 
land  use  classifications  to  get  an  indication  of  spatial  accuracies  associated  with  LANDSAT 
data  and  the  classification  procedures. 
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Based  on  our  experience  in  using  the  UCD  procedure  and  come rc  ia lly  derived  LANDSAT  data, 
recommendations  are  presented  regarding  the  role  of  remote  sensing  information  in  the  Corps  of 
Engineers  hydrologic  investigations  program. 

LAM) SAT  LAND  USE  FOR  HYDROLOGIC  MODELING 


The  hydrologic  modeling  of  a  watershed,  particularly  urban  or  urbanizing  basins,  requires 
that  the  distribution  of  land  use  be  determined.  The  amount  and  timing  of  runoff  is  directly 
related  to  the  infiltration  capacity  of  a  land  area  with  the  most  important  distinction  being 
between  pervious  and  impervious  land  surfaces.  Water  quality  parameters  have  a  similar 
dependence  on  land  use  data;  rate  of  accumulation  of  a  particular  pollutant  per  unit  area  is 
normally  expressed  as  a  function  of  land  use.  Water  resource  planning  studies  are  interested 
in  not  only  an  assessment  of  the  present  state  of  the  water  and  related  resource  system,  but 
also  its  possible  future  configuration.  By  expressing  hydrologic  parameters  as  a  function  of 
current  land  use  it  becomes  possible  to  rationally  predict  the  impact  future  land  use  changes 
will  have  on  the  quantity  and  quality  of  future  runoff. 

Manual  methods  for  land  use  identification  (e.g.,  interpretation  of  low  altitude  aerial 
photography  and  field  surveys)  are  frequently  used  in  watershed  studies.  With  this  approach, 
the  resource  requirements,  both  money  and  labor,  for  manual  classification  can  be  extensive. 

An  attractive  alternative  is  the  utilization  of  available  remote  sensing  systems  and 
computer-assisted  classification  techniques. 

The  LANDSAT  satellites  have  been  shown  to  have  the  capability  of  providing  land  use  data 
at  acceptable  levels  of  accuracy  for  hydrologic  modeling  purposes,  (Jackson,  1977;  Ragan, 
1975).  LANDSAT  data  is  quicker  and  less  costly  to  obtain  and  interpret  than  low  altitude 
aerial  photography,  provides  repetitive  coverage  of  the  same  area  at  least  every  18  days,  and 
is  available  for  all  United  States  and  many  worldwide  locations.  Additionally,  LANDSAT's 
digital  format  can  be  directly  analyzed  by  several  available  classification  computer  programs, 
and  can  be  resampled  for  automatic  inclusion  in  a  geographic  data  bank. 

UCD  PROCEDURES 


An  operational  procedure  for  land  use  classification  from  IANDSAT  data  has  been  developed 
at  the  University  of  California,  Davis  (UCD)  for  use  by  the  Corps  of  Engineers.  Referred  to 
as  the  UCD  Procedure,  it  was  designed  to  function  under  the  following  constraints: 

(1)  Only  output  equipment  normally  available  in  Corps'  field  offices  (e.g.,  line  printer) 
and  batch-mode  access  to  a  general  purpose  computer  could  be  expected.  This  would  eliminate 
the  need  for  highly  expensive,  dedicated,  interactive  image  processing  facilities. 

(2)  No  additional  software  beyond  that  provided  as  part  of  the  procedural  package  would 
be  required. 

(3)  No  specialized  technical  expertise  in  data  analysis,  computer  programming,  or  remote 
sensing  would  be  required. 

(4)  The  final  classification  would  be  a  usable  product;  i.e.,  one  that  can  conveniently 
be  entered  into  a  grid  cell  data  bank  and  that  will  adequately,  from  a  hydrologic  viewpoint, 
represent  current  land  use  conditions. 

The  UCD  procedure  consists  of  an  organized  set  of  computer  programs  and  manual  operations 
for  the  identification  of  land  use  from  raw  LANDSAT  data.  A  detailed  description  of  the 
procedure  ia  given  by  Algazi  (1979)  and  Meyer  (1978).  What  follows  is  a  brief  outline  of  the 
primary  tasks: 

(1)  Obtain  LANDSAT  Compute  r-Compa  t  ib  le  Tapes  (CCT),  NASA  high  altitude  aerial 
photography,  and  USCS  topographic  maps  for  the  location  and  date  of  interest.  Extract  a 
rectangular  area  of  data  containing  the  watershed  from  the  CCT.  Check  for  radiometric  errors 
in  the  LAMJSAT  digital  data  and,  if  necessary,  correct. 
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(2)  Determine  the  geometric  registration  of  the  LANDSAT  image  with  the  coordinate  system 
of  the  topographic  maps.  LAM5SAT  control  points  are  identified  from  the  output  of  a  UCD 
computer  program  which  enhances  roads  and  water  bodies  found  in  the  LANDSAT  image.  A 
regression  equation,  estimated  from  the  two  sets  of  control  points,  provides  a  transformation 
mechanism  for  going  between  the  image  coordinate  system  and  the  map  coordinate  system, 
Universal  Transverse  Mercator  (UTM). 

(3)  Use  an  unsupervised  clustering  algorithm  to  partition  the  LANDSAT  four-dimensional 
data  space.  Groups  or  "clusters"  are  identified  that  contain  points  with  spectral  reflectance 
values  that  are  similar  to  members  of  the  same  cluster,  and  dissimilar  to  the  points  of  other 
clusters.  The  clustering  program  is  allowed  to  generate  a  maximum  of  30  clusters.  Each  pixel 
in  the  watershed  data  is  assigned  to  a  cluster. 

(4)  Select  from  a  line  printer  map  of  the  cluster  assignments  six  sets  of  adjacent  pixels 
(spatial  groups),  all  belonging  to  the  same  cluster.  Their  corresponding  location  on  the 
topographic  maps  is  determined  using  the  transformation  equation  of  step  (2).  Visual 
translation,  from  the  map  to  the  aerial  photographs,  of  the  spatial  group's  location  permits  a 
land  use  to  be  assigned  to  each  spatial  group.  For  clusters  having  a  consistent  land  use 
assigned  to  all  six  spatial  groups,  a  final  land  use  has  been  determined.  But  for  those 
clusters  where  conflicts  exist  between  the  land  use  identified  with  each  of  the  six  spatial 
groups,  further  partitioning  of  the  data  space  is  required. 

(5)  Clusters  with  conflicting  land  use  assignments  and  cl'sters  whose  associated  land  use 
could  not  be  determined  from  the  available  maps  and  photos  are  reclustered  by  repeating  step 
(3),  and  given  final  land  use  assignments  by  repeating  step  (4). 

(6)  At  this  point  the  watershed  data  file  contains  a  land  use  c  lass  i  ficat  ion  (typically  5 
to  7  categories)  for  all  its  pixels.  The  watershed  file  is  then  resampled  at  the  grid  cell 
centroids  using  a  nearest-neighbor  algorithm.  The  size  of  the  grid  cells  is  usually  line 
printer  compatible  with  the  scale  of  USGS  7-1/2-min.  topographic  maps. 

(7)  The  resanpled  file  is  entered  directly  into  a  grid  cell  data  bank.  Alternatively,  a 
file  containing  the  digitized  (in  UTM  coordinates)  watershed  boundary  can  be  used  to  mask  the 
resampled  file,  leaving  only  the  grid  cells  within  the  boundary.  Total  acreage  of  each  land 
use  class  for  the  entire  watershed  is  then  computed. 

HYDROLOGIC  LAND  USE  COMPARISON 


The  primary  reason  for  examining  the  land  use  classification  ability  of  LANDSAT  was  for 
its  potential  application  to  hydrologic  modeling.  The  computer  program  HEC-1  (U.S.  Army  Corps 
of  Engineers,  1973)  has  the  capability  of  explicitly  relating  land  use  to  runoff  using  two 
procedures:  Snyder's  unit  hydrograph  with  percent  imperviousness,  and  the  SCS  curve  number 
and  unit  hydrograph  (U.S.  Soil  Conservation  Service,  1972).  The  HYDPAR  program  (U.S.  Army 
Corps  of  Engineers,  1978b)  obtains  the  necessary  information  from  a  grid  cell  data  bank  and 
computes  the  specified  hydrologic  parameters,  which  are  in  turn  input  into  an  HEC-1  model  of 
the  basin.  HYDPAR  contains  a  regression  equation  formulation  of  Snyder's  lag  as  a  function  of 
stream  length,  length  to  centroid  of  subbasin,  stream  slope,  and  percent  imperviousness.  A 
table  associating  a  percent  imperviousness  with  each  land  use  category  in  the  data  bank 
enables  HYDPAR  to  compute  subbasin  percent  imperviousness  from  subbasin  land  use  distribution. 

In  a  similar  manner  HYDPAR  can  determine  the  SCS  unit  hydrograph  parameter  from  stream 
length,  basin  average  land  slope,  and  subbasin  average  curve  number.  Curve  numbers  represent 
an  empirical  relationship  between  hydrologic  soil  type,  land  use,  and  their  resultant  runoff 
potential.  From  a  table  identifying  a  curve  number  with  each  combination  of  land  use  and 
hydrologic  soil  type,  HYDPAR  computes  subbasin  average  curve  number. 

Both  hydrologic  modeling  techniques,  Snyder's  unit  hydrograph  with  percent  imperv iousness 
and  the  SCS  curve  number  approach,  were  applied  to  the  land  use  classifications  of  two 
watersheds:  Rowlett  Creek  near  Dallas,  Texas,  and  Pennypack  Creek  in  Philadelphia, 
Pennsylvania.  For  each  watershed  land  use  was  determined  from  LANDSAT  imagery  using  the  UCD 
Procedure,  and  conventional  interpretation  of  low-altitude  aerial  photography. 

The  following  contains  sumnary  results  of  the  hydrologic  land  use  comparison.  Complete 
details  of  the  study  are  reported  in  (U.S.  Army  Corps  of  Engineers,  1979). 
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ROWLETT  CREEK 


A  calibrated  HEC-1  model  of  a  24.6  square  mile  (63.7  Kin^)  portion  of  the  Rowlett  Creek 
basin,  referred  to  as  Upper  Spring  Creek,  was  used  to  simulate  runoff  from  selected  recurrence 
interval  rainfall.  Initial  assignments  of  percent  imperviousness  to  Rowlett  Creek's  LANDSAT 
and  conventional  land  use  categories  permitted  HYDPAR  to  compute  Snyder's  lag  for  the 
twenty-three  subbasins  in  Upper  Spring  Creek.  Nearly  identical  values  of  lag  were  calculated 
from  the  LANDSAT  and  conventional  land  use  data. 

The  calibrated  HEC-1  model  (using  the  two  land  use  estimates  of  Snyder's  lag)  and 
synthetic  rainfall  produced  the  discharge  values  plotted  in  Figure  1  for  selected  stations  in 
the  Upper  Spring  Creek  drainage.  Differences  between  such  discharge  frequency  curves  can  be 
interpreted  as  a  measure  of  the  hydrologic  significance  of  LANDSAT's  misclassi fication  of  land 
use.  Considering  the  uncertainty  involved  in  estimating  a  frequency  curve  (even  from  observed 
data),  the  difference  between  LANDSAT  and  conventional  curves  is  insignificant. 

PENNYPACK  CREEK 

The  SCS  curve  number  method  was  used  to  model  the  Pennypack  Creek  basin  (55.6  .  144.5 

Km^).  Curve  numbers  were  assigned  to  the  LANDSAT  and  conventional  land  use  categories.  For 
each  of  Pennypack  Creek's  sixty-five  subbasins  HYDPAR  calculated  subbasin  average  curve  number 
and  subbas  in  lag.  Once  again,  nearly  identical  values  of  subbasin  lag  were  computed  from  the 
LANDSAT  and  conventional  land  use  data. 

As  an  additional  comparison,  subbasin  average  curve  number  and  lag  were  calculated  for  (1) 
all  land  use  categories  assigned  the  industrial  category  curve  numbers,  and  (2)  all  land  use 
categories  assigned  the  natural  vegetation  curve  numbers.  Parameters  estimated  in  these  two 
cases,  and  the  discharge  frequency  curves  derived  from  them,  demonstrate  the  possible  extremes 
(in  terms  of  runoff)  that  could  have  been  generated  from  the  mu 'el. 

The  calibrated  HEC-1  model  of  Pennypack  Creek  simulated  the  basin’s  discharge  frequency 
behavior  for  conventional,  LANDSAT,  all  industrial,  and  all  natural  vegetation  conditions. 

The  resulting  discharge  frequency  curves  for  the  entire  drainage  area  are  shown  in  Figure  2. 

It  is  clear  from  this  figure,  especially  with  reference  to  what  could  have  been  (i.e.,  all 
industrial  and  all  natural  vegetation  conditions),  that  the  difference  between  LANDSAT  had 
conventionally  derived  frequency  curves  is  not  significant. 

ACCURACY  VS.  SPATIAL  INTEGRITY 

As  previously  mentioned,  in  order  to  be  able  to  conduct  the  hydrologic  modeling 
assessment,  the  LANDSAT  land  use  was  processed  into  existing  geographic  information  system 
data  banks.  These  data  banks  already  contained  an  exhaustive,  spatially  accurate 
representat ion  of  the  land  use  which  was  derived  by  conventional  means.  The  grid  cell  size  of 
these  existing  data  banks  varied  in  size  from  0.74  acres  (Pennypack  Creek)  to  1.148  acres 
(Rowlett  Creek  and  Walnut  Creek)  to  1.53  acres  (Trail  Creek  and  Castro  Valley).  The 
description  of  the  ways  in  idi  ich  the  conventional  land  use  classification  and  the  LANDSAT 
classification  were  derived  is  presented  in  detail  in  HEC's  Research  Note  No.  7  (U.S.  Army 
Corps  of  Engineers,  1979).  Table  1,  UCD  Classification  vs.  Conventional  Land  Use,  shows  a 
summary  of  the  cell-by-cell  comparison  for  the  Walnut  Creek  watershed. 

For  a  cell-by-cell  comparison  of  the  Walnut  Creek  watershed  it  was  necessary  to  establish 
an  explicit  aggregation  of  the  conventional  land  use  categories  into  the  fewer  LANDSAT  land 
use  categories,  Table  2.  The  RIA  computer  program  (U.S.  Army  Corps  of  Engineers,  1978a) 
generated  the  coincident  matrix,  Table  l.  The  structure  of  this  cross  tabulation  table  is 
similar  to  others  that  will  be  presented  later  in  this  paper.  Each  element  of  the  table  (row 
and  column  combination)  refers  to  all  grid  cells  within  the  watershed  data  bank  that  have  the 
concur  rent  LANDSAT  and  conventional  land  use  specified  by  the  row  and  column  headings  of  that 
particular  element.  For  example,  the  2nd  row,  1st  column  of  Table  1  refers  to  all  grid  cells 
in  the  Walnut  Creek  data  bank  that  are  classified  both  commerc  ial/ indust  rial  by  LANDSAT  and 
residential  by  the  conventional  classification.  For  each  element  of  the  table,  four  numbers 
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DISCHARGE  (tOOO  cf« ) 


TABLE  1 


Walnut  Creek  Land  Use  Comparison 
UCD  LAND SAT  vs.  CONVENTIONAL 


Acres* 

Z  Row 

CONVENTIONAL 

Z  Col 

Z  Total 

RES 

COM/IND 

QUARRY 

CROP/ PASTURE 
/RANGE 

FOREST 

WATER 

ROW 

TOTAL 

3801 

1098.8 

225 

3147 

859 

6 

9136 

RES 

41.6 

12.0 

2.5 

34.4 

9.4 

0.1 

100.0 

59.8 

31.6 

22.0 

19.1 

9.4 

9.5 

25.0 

10.4 

3.0 

0.6 

8.6 

2.3 

0.0 

25.0 

471 

816 

130 

366 

HE 

0 

1861 

COM/IND 

25.3 

43.8 

7.0 

19.7 

0.0 

7.4 

23.5 

12.7 

2.2 

0.0 

5.1 

1.3 

2.2 

0.4 

1.0 

0.2 

0.0 

5.1 

61 

111 

17 

21 

0 

223 

UCD 

QUARRY 

49.8 

7.6 

9.4 

0.0 

100.0 

LAND  SAT 

Hi  ■ 

10.9 

O.l 

0.2 

0.0 

0.6 

0.0 

Bfl 

0.3 

0.0 

0.1 

0.0 

0.6 

1581 

1106 

313 

9324 

2947 

28 

15229 

CROP/ 

10.3 

7.2 

2.0 

60.9 

19.3 

0.2 

100.0 

PASTURE 

24.9 

31.8 

30.7 

56.4 

32.2 

44.4 

41.8 

4.3 

3.0 

0.9 

25.5 

. 

8.1 

0.1 

.8 

493 

361 

234 

3655 

5233 

13 

9989 

FOREST/ 

4.9 

3.6 

2.3 

36.6 

52.4 

0.1 

RANGE 

7.8 

10.4 

22.9 

22.1 

57.2 

20.6 

27.3 

1.3 

1.0 

0.6 

10.0 

_ 

14.3 

0.0 

27.3 

0 

31 

8 

■■ 

6 

16 

70 

WATER 

0.0 

44.3 

11.4 

8.6 

22.9 

■TiliMifl 

0.0 

0.9 

0.8 

0.1 

0.1 

25.4 

lEQ 

0.0 

0.  1 

0.8 

0.0 

0.0 

0.0 

U 

COLUMN 

6359 

3473 

1021 

16518 

9144 

63 

36578 

TOTAL 

17.4 

9.5 

2.8 

45.2 

25.4 

0.2 

100.0 

100.0 

100.0 

100.0 

100.0 

100.0 

100.0 

100.0 

17.4 

9.5 

2.8 

45.2 

25.0 

0.2 

100.0 

*To  obtain  hectares,  multiply  acres  by  2.471. 
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TABLE  2 


WALNUT  CREEK 

LAND  USE  CATEGORY  MAPPING 


Conventional  Land  Use 

UCD  LANDSAT 

Bat te lie  LANDSAT 

GE  LANDSAT 

Low  density  single  family  residential 
Medium  density  single  family  residential 
High  density  single  family  residential 
Miltifaraily  residential 

Mob  i  le  home  pa  rk  s 

res identia 1 

res identia 1 

residential  1 
residential  2 

Strip  commercial 

Shopping  centers 
'.nst  itut  iona  1 

Industrial 

Industrial  and  commercial  complexes 

Public  Use:  cemetaries,  public  assembly 
areas,  waste  disposal  areas 
Transportation,  coranunication,  utilities 

commerc  ia  1/ 
industria 1 

_  .  1 

industrial/ 
comnerc  ia  1 
transportation 

urban 

Barren  land /quarry 

barren  land/ 

barren  land 

highly 

reflective 

Cropland 

crop  land / 

cropland/ 

vegetation/ 
cr<  ps 

Pasture /  range  land 

Developed  open  space 

Undeveloped  urban  land 

pasture 

pasture 
range  land 

dark  fields 
open  area 

Forest 

forest  / 
range  land 

forest 

riparian 

woodland 

Water 

wate  r 

water 

_ i 

1 
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are  given:  (1)  total  acreage  of  all  grid  cells  represented  by  the  appropriate  joint  land  use 
classification;  (2)  row  percent,  or  the  precent  of  a  1 1  grid  cells  with  the  given  LANDSAT 
classification  that  have  a  Iso  the  given  conventional  classification;  (3)  column  percent,  or 
the  percent  of  all  grid  cells  with  the  given  conventional  land  use  that  have  also  the  given 
LANDSAT  land  use;  and  (4)  total  percent,  or  the  percent  of  the  ent  ire  watershed  that  has  the 
given  joint  land  use  classification.  Continuing  our  example  above,  471  acres  (1186  Hectares) 
were  found  to  be  classified  both  comme rc  ia  1/ industr ia  1  by  LANDSAT  and  residential  by 
conventional  means.  This  acreage  represents  25. 32  of  all  the  area  (1861  acres)  that  was 
classified  by  LANDSAT  as  conmerc  ia  1/industria  1,  7.4%  of  all  the  area  (6359  acres)  that 
belonged  to  the  conventional  residential  category,  and  1.3%  of  the  total  watershed  area 
(36,578  acres). 

The  far  right  colunn  (row  total)  and  the  bottom  row  (coluim  total)  of  Table  1  give  the 
marginal  distributions  of  LANDSAT  and  conventional  land  use,  respectively.  These  represent 
the  acres  and  percent  in  the  different  land  use  categories  without  the  conditional  requirement 
described  above  for  the  body  of  the  table. 

Figures  3  and  4  show  the  spatial  location  of  both  the  conventional  and  the  UCD  classified 
land  use  vdiich  are  part  of  the  Walnut  Creek  grid  cell  data  bank. 

COMMERCIAL  CAPABILITY 

The  UCD  procedure  provided  completely  acceptable  land  use  percentages  for  the  subbasins 
and  total  watershed  areas  (the  marginal  values  in  Table  1)  which  were  input  into  the 
hydrologic  models.  Spurred  on  by  this  initial  success,  the  HEC  further  participated  in  the 
Water  Management  and  Control  ASVT  by  comparing  conmerc  ia  1  ly  prepared  LANDSAT  land  use 
classifications.  The  objectives  of  HEC's  participation  were:  (1)  to  compare  the  results  of 
the  UCD  procedure  to  the  results  from  conmerc  ial  vendors,  (2)  to  provide  cost  data  to  a 
cost-effectiveness  study  of  LANDSAT  determined  land  use  which  was  part  of  the  ASVT  and,  (3)  to 
evaluate  the  ability  of  commercial  vendors  to  create  a  computer  file  which  could  be  directly 
inserted  as  a  data  variable  into  an  existing  erid  cell  data  bank.  In  order  to  execute  this 
portion  of  the  ASVT,  the  Walnut  Creek  data  bank  was  selected  as  the  test  area  because  (1)  the 
HEC  was  confident  in  the  spatial  integrity  and  accuracy  of  the  conventional  land  use  in  the 
data  bank,  (2)  the  grid  cell  size  was  1.148  acres  which  was  comparable  to  a  pixel  size  of  1.1 
acres  and,  (3)  the  grid  pattern  for  the  study  area  was  orientated  to  align  with  the  UTM 
coordinate  system.  The  commercial  contractors  which  were  selected  to  participate  in  this 
portion  of  the  study  were  General  Electric  and  Bat  te  lie's  Pacific  Northwest  Labratories  in 
Richland,  Washington.  Both  commercial  firms  created  resampled  tapes  which  contained  the  land 
use  value  vAi  ich  corresponded  to  the  centroid  location  of  each  of  the  Corps  rectangular  grid 
cells  (200  feet  x  250  feet).  Table  2  contains  the  commercial  LANDSAT  land  use  classifications 
which  correspond  to  the  conventional  classification.  Tables  3  and  4  show  the  results  of  the 
cell-by-cell  comparisons  for  the  Battelle  and  General  Electric  classifications,  respectively. 

Using  the  diagonal  percentages  in  Tables  1,  3  and  4,  the  overall  cell  by  cell  accuracies 
of  the  UCD  procedure,  Battelle  and  General  Electric  were  53,  44  and  44  percent,  respectively. 
In  evaluating  the  spatial  integrity  of  the  LANDSAT  derived  land  use  from  the  various 
classification  procedures,  there  are  three  potential  sources  of  error:  (1)  differences 
resulting  from  the  land  use,  land  cover  conflict,  (2)  geometric  correction  and  resampling 
error  and  (3)  misc  1  ass  i  f  icat  ion  of  the  pixel  spectral  signatures. 

When  comparing  LANDSAT  and  conventional  land  use  classifications,  it  is  important  to 
recognize  that  the  same  land  cover  can  be  interpreted  differently;  i.e.,  conventional  land  use 
categories  are  not  always  compatible  with  1ANDSAT  land  cover  categories.  For  example,  the 
conventional  category  "transpor  tat  ion/comnunicat  ion/ut  i  I  it  ies  ”  includes  major  highways, 
right-of-way  for  railroads  and  power  transmission  lines,  communication  towers,  airport 
facilities  (including  buildings,  runways,  and  vacant  land  within  the  airport  limits),  and 
sewage  treatment  plants.  In  contrast,  LANDSAT  will  recognize  the  treatment  plant  settling 
tanks  as  "water  bodies",  the  open  fields  surrounding  a  runway  as  one  of  the  vegetation 
categories,  and  right-of-ways  as  whatever  land  cover  class  is  nearby.  Even  though  this  is  a 
problem  with  any  LAtCSAT  land  use  classification,  in  the  Walnut  Creek  watershed  only  a  very 
small  portion  of  the  watershed  had  the  potential  for  this  kind  of  error. 
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TABLE  3 

Walnut  Creek  Land  Use  Comparison 
Bat  te  He  LANDSAT  vs.  Conventional 


Acres* 
Z  Row 
Z  Col 
Z  Total 


CONVENTIONAL 


COM/1ND 


BATTELLE  QUARRY 
LANDSAT 


CROP/ 

PASTURE 

RANGE 


FOREST 


RES 

COM/ 1  NO 

QUARRY 

2723 

976 

70 

41.8 

15.0 

1.1 

45.6 

31.6 

7.8 

2.8 

Hi 

661 

336 

28.9 

14.7 

3.5 

11.1 

10.9 

8.  1 

1.9 

1 .0 

0.2 

212 

160 

111 

15 

11.3 

7.8 

3.5 

5.2 

11.3 

0.6 

0.5 

0.3 

1720 

1137 

517 

9.5 

6.3 

2.9 

28.8 

36.8 

52.9 

5.0 

3.3 

I  .5 

636 

464 

201 

10.  1 

7.4 

3.2 

10.6 

15.0 

20.6 

1 .8 

1.3 

0.6 

23 

13 

0 

42.6 

24.1 

0.0 

0.4 

0.4 

0.0 

0.1 

0.0 

0.0 

5975 

17.2 

100.0 

17.2 

3086 

8.9 

100.0 

8.9 

9  78 

2.8 

100.0 

2.8 

CROP/PASTURE 

/RANGE 


FOREST 


9803 
54.1 
61 .8 
28.3 


2572 

40.9 

16.2 

7.4 


TABLE  4 


Walnut  Creek  Land  Use  Comparison 
G.  E.  LAND  SAT  vs.  Conventional 


Acres* 

X  Row 

X  Col 

X  Total 

CONVENTIONAL 

ROW 

TOTAL 

RES 

COM/IND 

QUARRY 

CHOP/ PASTURE 
/RANGE 

FOREST 

WATER 

3573.7 

1060.8 

205.5 

2308.6 

679.6 

5.7 

7833.9 

RES 

45.6 

13.5 

2.6 

29.5 

8.7 

0.1 

100.0 

57.4 

31.6 

20.8 

14.8 

7.6 

9.2 

22.3 

10.2 

3.0 

0.6 

6.6 

1.9 

0.0 

22.3 

464.90 

693.4 

102.2 

1678.3 

231.9 

3.4 

3174.2 

COM/IND 

14.7 

21.8 

3.2 

52.8 

7.3 

0.1 

100.0 

7.5 

20.6 

10.4 

10.8 

2.6 

5.5 

9.0 

1.3 

2.0 

0.3 

4.8 

0.7 

0.0 

9.0 

358.2 

690. 1 

289.3 

433.9 

106.8 

16.1 

1894.2 

G.E. 

QUARRY 

18.9 

36.4 

15.3 

22.9 

5.6 

0.9 

100.0 

LAND  SAT 

5.8 

20.5 

29.3 

2.8 

1.2 

26.0 

5.4 

1.0 

2.0 

0.8 

1.2 

0.3 

0.1 

5.4 

AGRICUL- 

1587.6 

677.4 

261.8 

8264.6 

5208.5 

13.7 

16013.6 

TURE/ 

9.9 

4.2 

1.6 

51.6 

32.5 

0.1 

100.0 

OPEN  SPACE 

25.5 

20.2 

26.6 

53.1 

58.3 

22.1 

45.6 

4.5 

1.9 

0.8 

23.5 

14.8 

0.0 

45.6 

242.2 

238.9 

127.4 

28  71 .1 

2701.2 

23.0 

6203.8 

WOODLAND 

3.9 

3.9 

2.1 

46.3 

43.5 

0.0 

100.0 

3.9 

7.  1 

12.9 

18.5 

30.3 

37.1 

17.7 

0.1 

0.7 

0.4 

8.2 

7.7 

0.1 

17.7 

0 

0 

0 

0 

0 

0 

0 

WATER 

0 

0 

0 

0 

0 

0 

100.0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

COLUMN 

6226.8 

3360.5 

986.2 

15556.6 

8928.0 

62.0 

35119.6 

TOTAL 

17.4 

9.6 

2.8 

44.3 

25.4 

0.2 

100-0 

100.0 

100.0 

100.0 

100.0 

100.0 

100.0 

100.0 

17.7 

9.6 

2.8 

44.3 

25.4 

0.2 

100.0 

*To  obtain  hectares,  multiply  acres  by  2.471. 
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A  comparison  at  the  grid  cell  level  can  be  thought  of  as  a  comparison  at  the  pixel  level; 
both  units  of  area  are  nearly  the  same  size.  At  this  scale  errors  introduced  during  the 
geometric  correction  and  resampling  steps  will  be  erroneously  interpreted  as  LANDSAT 
misc lass i ficat ion  errors.  This  will  be  particularly  true  of  land  use  categories  defined  by  a 
small  number  of  adjacent  pixels.  The  border  areas  of  the  larger  land  use  categories  are  also 
susceptible  to  geometric  problems.  In  order  to  assess  the  magnitude  of  the 

registration/resampling  error,  the  RIA  program  was  used  to  calculate  the  linear  distance  each 
grid  cell  was  from  the  nearest  residential  category.  These  distances  were  then  overlaid  on 
the  actual  residential  locations  from  the  three  LANDSAT  classifications.  The  reason  for  this 
overlay  was  that  if  the  major  problem  in  the  cell-by-cell  comparison  was  in  the  geometric 
correction/resampling  procedures,  all  of  the  LANDSAT  residential  areas  should  be  within  one  or 
two  grid  cells  of  an  actual  residential  area.  The  LANDSAT  residential  areas  which  located 
beyond  this  distance  could  be  assumed  to  be  errors  in  misc  lass  if  icat  ion. 

Figure  5,  LAtCSAT  RESIDENTIAL  VS  DISTANCE  TO  ACTUAL  RESIDENTIAL,  is  a  histogram  of  the 
spatial  integrity  of  the  three  classifications  for  residential  land  use  and  is  shown  as  a 
percentage  of  the  residential  area  for  each  classification.  The  figure  shows  that  at  least  60 
percent  of  all  these  classifications  fell  within  2  grid  cells  of  the  ground  truth  residential 
areas.  The  Battelle  classification  was  more  accurate  for  the  residential  areas,  but  when 
compared  for  all  land  use  categories  it  was  less  accurate  than  the  UCD  procedure.  There  were 
at  least  two  reasons  for  Battelle 's  higher  residential  accuracy  when  compared  to  UCD  and  G.E.  ; 
first,  they  had  personnel  from  the  Corps  who  were  very  familiar  with  the  study  area  to  help 
with  the  classification,  vftich  was  not  the  case  in  the  UCD  and  G.E.  classifications  and 
second,  they  had  the  capability  to  predefine  urban  areas  and  reclassify  clusters  vAiich  were 
given  a  residential  value  but  were  located  outside  "known"  residential  areas. 

The  third  source  of  error,  misc  lassi  ficat  ion,  was  looked  at  for  only  the  UCD 
classification.  A  RIA  coincidence  table  was  generated  for  the  conventional  land  use  and  the 
cluster  values  to  see  if  any  of  the  clusters  were  misc  lass  i  fied .  The  HEC  could  not  identify 
any  obvious  misc  1  assi  ficat  ions,  therefore,  it  was  assumed  that  the  remaining  error  (40-60 
percent)  in  the  cell-by-cell  comparison  was  due  to  the  limitations  of  either  the  clustering 
techniques  or  the  satellite  sensors  themselves. 


Flgurs  5 

LANDSAT  RESIDENTIAL  VS  DISTANCE  TO  ACTUAL  RESIDENTIAL 


RESAMPLED  TAPES 


The  resampled  tapes  supplied  by  UCD,  Battelle,  C.E.  and  Bend  ix  (for  the  Castro  Valley 
Watershed)  were  required  to  conform  to  the  following  specifications: 

(1)  The  first  record  on  the  tape  would  be  a  header  record  which  would  give  the 
number  of  rows  and  the  number  of  columns  in  the  data  matrix. 

(2)  The  data  would  be  supplied  a  row  (line)  at  a  time,  for  however  many  number 
of  rows  there  were  in  the  study  area. 

(3)  The  first  land  use  value  would  correspond  to  the  grid  location  1,  1  on  the 
Walnut  Creek  or  Castro  Valley  Basemap. 

(4)  If  the  contractor  windowed  out  the  watershed,  cells  outside  the  study  area 
would  be  given  a  land  use  value  of  -1. 

In  addition,  all  contractors  were  warned  that  if  they  windowed  the  study  area,  to  include 
at  least  some  buffer  area  on  the  resampled  tape  to  make  sure  that  all  grid  cells  in  the 
existing  data  base  would  be  given  a  value. 

The  HEC  was  able  to  process  the  UCD  and  Battelle  tapes  directly  into  the  existing  data 
base.  The  Battelle  tape,  however,  had  several  problems.  After  the  tape  had  been  provided  to 
HEC,  it  was  discovered  that  the  wrong  corner  grid  cell  was  used  in  the  resampling.  All  land 
use  comparisons  of  the  Battelle  classification  in  this  paper  will  be  affected  by  this  known 
resampling  error.  In  addition,  when  the  Battelle  resampled  file  was  entered  into  the  Walnut 
Creek  grid  cell  data  bank  only  34,697  acres  (54.21  sq.  mi.)  were  located  within  the 
geographically  correct  watershed  boundary,  the  latter  being  defined  in  the  data  bank  as 
containing  36,574  acres  (57.15  sq.  mi.).  The  total  classified  land  area  listed  on  Battelle's 
color-coded  map  was  35,869  acres  (56.05  sq.  mi.). 

The  G.E.  tape  was  not  produced  to  the  original  specifications,  so  that  a  new  resampled 
tape  had  to  be  generated,  and  it  also  had  resampling  problems  so  that  it  only  contained  35,120 
acres  (54.9  sq.  mi.). 


CONCLUSION 


The  LAtOSAT  derived  land  use  classification  percentages  are  veil  within  an  acceptable 
error  to  be  used  for  hydrologic  modeling.  The  LANDSAT  derived  land  use  classification  may 
also  be  successfully  placed  into  existing  grid  cell  data  banks  to  be  used  in  other  types  of 
analysis,  such  as  environmental  assessments.  Caution  needs  to  be  made  though  on  the  spatial 
integrity  of  the  LAtOSAT  derived  land  use  for  grid  cell  data  banks  vAiich  have  a  grid  cell  size 
which  is  at  about  the  same  resolution  as  a  pixel.  The  HEC  is  currently  investigating  the 
accuracies  associated  with  a  4.6  and  a  10.3  acre  grid  cell  size  with  the  same  data. 

The  UCD  procedure  was  successful  in  eliminating  the  need  for  expensive  image  processing 
equipment  and  its  accuracy  was  as  good  or  better  than  the  commercial  firms  which  supplied  land 
use  data  for  the  same  area.  This  paper  presents  problems  encountered  with  the  commercial 
products  during  the  conduct  of  HEC's  portion  of  the  ASVT.  The  authors  fully  realize  that 
because  of  continuous  technology  changes  that  these  problems  may  have  been  eliminated. 
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